Your browser doesn't support javascript.
Show: 20 | 50 | 100
Results 1 - 14 de 14
Filter
1.
CEUR Workshop Proceedings ; 3395:361-368, 2022.
Article in English | Scopus | ID: covidwho-20232900

ABSTRACT

Determining sentiments of the public with regard to COVID-19 vaccines is crucial for nations to efficiently carry out vaccination drives and spread awareness. Hence, it is a field requiring accurate analysis and captures the interest of many researchers. Microblogs from social media websites such as Twitter sometimes contain colloquial expressions or terminology difficult to interpret making the task a challenging one. In this paper, we propose a method for multi-label text classification for the track of”Information Retrieval from Microblogs during Disasters (IRMiDis)” presented by the”Forum of Information Retrieval Evaluation” in 2022, related to vaccine sentiment among the public and reporting of someone experiencing COVID-19 symptoms. The following methodologies have been utilised: (i) Word2Vec and (ii) BERT, which uses contextual embedding rather than the fixed embedding used by conventional natural language models. For Task 1, the overall F1 score and Accuracy are 0.503 and 0.529, respectively, placing us fourth among all the teams, while for Task 2, they are 0.740 and 0.790, placing us second among all the teams who submitted their work. Our code is openly accessible through GitHub. 1 © 2022 Copyright for this paper by its authors.

2.
7th Arabic Natural Language Processing Workshop, WANLP 2022 held with EMNLP 2022 ; : 511-514, 2022.
Article in English | Scopus | ID: covidwho-2304479

ABSTRACT

Propaganda content has seen massive spread in the biggest social media networks. Major global events such as Covid-19, presidential elections, and wars have all been infested with various propaganda techniques. In participation in the WANLP 2022 Shared Task(Alam et al., 2022), this paper provides a detailed overview of our machine learning system for propaganda techniques classification and its achieved results. The task was carried out using pre-trained transformer based models: ARBERT and MARBERT. The models were fine-tuned for the downstream task in hand: multilabel classification of Arabic tweets. According to the results, MARBERT and ARBERT attained 0.562 and 0.567 micro F1-score on the development set of subtask 1. The submitted model was MARBERT which attained a 0.597 micro F1-score and got the fifth rank. © 2022 Association for Computational Linguistics.

3.
2022 International Conference on Electrical Engineering and Sustainable Technologies, ICEEST 2022 ; 2022.
Article in English | Scopus | ID: covidwho-2297523

ABSTRACT

COVID-19 is one the most lethal virus, causing millions of death to date. It was initially detected in Wuhan, China. It then spread rapidly around the globe, which resultantly created major setbacks in the public health sector. The reason of millions of deaths is not only due to the virus itself but it is also linked to peoples' mental state, and sentiments triggered by the fear of the virus. These sentiments are predominantly available on posts/tweets on social media. This paper presents a novel approach for exploratory data analysis of twitter to understand the emotions of general public;country wise, and user wise. Firstly K-Means clustering is employed for topic modeling to categorize the emotions in each tweet. Further supervised machine learning techniques are used to categorize the multi-label tweets. This research concluded that Fear was the most common emotion in twitter discussion. Furthermore, we classified the dataset by performing decision tree (DT), logistic regression (LR), and support vector machine (SVM), finally this paper concluded the results of classification, which shows that SVM can attain better classification accuracy (99%) for COVID-19 text classification. © 2022 IEEE.

4.
Comput Biol Med ; 157: 106683, 2023 05.
Article in English | MEDLINE | ID: covidwho-2264789

ABSTRACT

-Thoracic disease, like many other diseases, can lead to complications. Existing multi-label medical image learning problems typically include rich pathological information, such as images, attributes, and labels, which are crucial for supplementary clinical diagnosis. However, the majority of contemporary efforts exclusively focus on regression from input to binary labels, ignoring the relationship between visual features and semantic vectors of labels. In addition, there is an imbalance in data amount between diseases, which frequently causes intelligent diagnostic systems to make erroneous disease predictions. Therefore, we aim to improve the accuracy of the multi-label classification of chest X-ray images. Chest X-ray14 pictures were utilized as the multi-label dataset for the experiments in this study. By fine-tuning the ConvNeXt network, we got visual vectors, which we combined with semantic vectors encoded by BioBert to map the two different forms of features into a common metric space and made semantic vectors the prototype of each class in metric space. The metric relationship between images and labels is then considered from the image level and disease category level, respectively, and a new dual-weighted metric loss function is proposed. Finally, the average AUC score achieved in the experiment reached 0.826, and our model outperformed the comparison models.


Subject(s)
Deep Learning , X-Rays , Image Processing, Computer-Assisted/methods , Thorax , Semantics
5.
Front Bioinform ; 1: 709951, 2021.
Article in English | MEDLINE | ID: covidwho-2089808

ABSTRACT

Development of vaccines and therapeutic antibodies to deal with infectious and other diseases are the most perceptible scientific interventions that have had huge impact on public health including that in the current Covid-19 pandemic. From inactivation methodologies to reverse vaccinology, vaccine development strategies of 21st century have undergone several transformations and are moving towards rational design approaches. These developments are driven by data as the combinatorials involved in antigenic diversity of pathogens and immune repertoire of hosts are enormous. The computational prediction of epitopes is central to these developments and numerous B-cell epitope prediction methods developed over the years in the field of immunoinformatics have contributed enormously. Most of these methods predict epitopes that could potentially bind to an antibody regardless of its type and only a few account for antibody class specific epitope prediction. Recent studies have provided evidence of more than one class of antibodies being associated with a particular disease. Therefore, it is desirable to predict and prioritize 'peptidome' representing B-cell epitopes that can potentially bind to multiple classes of antibodies, as an open problem in immunoinformatics. To address this, AbCPE, a novel algorithm based on multi-label classification approach has been developed for prediction of antibody class(es) to which an epitope can potentially bind. The epitopes binding to one or more antibody classes (IgG, IgE, IgA and IgM) have been used as a knowledgebase to derive features for prediction. Multi-label algorithms, Binary Relevance and Label Powerset were applied along with Random Forest and AdaBoost. Classifier performance was assessed using evaluation measures like Hamming Loss, Precision, Recall and F1 score. The Binary Relevance model based on dipeptide composition, Random Forest and AdaBoost achieved the best results with Hamming Loss of 0.1121 and 0.1074 on training and test sets respectively. The results obtained by AbCPE are promising. To the best of our knowledge, this is the first multi-label method developed for prediction of antibody class(es) for sequential B-cell epitopes and is expected to bring a paradigm shift in the field of immunoinformatics and immunotherapeutic developments in synthetic biology. The AbCPE web server is available at http://bioinfo.unipune.ac.in/AbCPE/Home.html.

6.
3rd Natural Legal Language Processing, NLLP 2021 ; : 46-62, 2021.
Article in English | Scopus | ID: covidwho-2046909

ABSTRACT

The COVID-19 pandemic has witnessed the implementations of exceptional measures by governments across the world to counteract its impact. This work presents the initial results of an on-going project, EXCEPTIUS, aiming to automatically identify, classify and compare exceptional measures against COVID-19 across 32 countries in Europe. To this goal, we created a corpus of legal documents with sentence-level annotations of eight different classes of exceptional measures that are implemented across these countries. We evaluated multiple multi-label classifiers on a manually annotated corpus at sentence level. The XLM-RoBERTa model achieves highest performance on this multilingual multi-label classification task, with a macro-average F1 score of 59.8%. © 2021 Association for Computational Linguistics.

7.
MediaEval 2021 Workshop, MediaEval 2021 ; 3181, 2021.
Article in English | Scopus | ID: covidwho-2011296

ABSTRACT

In this paper, we present our participation to the MediaEval-2021 challenge on fake news detection about coronavirus related Tweets. It consists in three subtasks that can be seen as multi-labels classification problems we solved with transformer-based models. We show that each task can be solved independantly with mutiple monotasks models or jointly with an unique multitasks model. Moreover, we propose a prompt-based model that has been finetuned to generate classifications from a pre-trained model based on DistilGPT-2. Our experimental results show the multitask model to be the best to solve the three tasks. Copyright 2021 for this paper by its authors.

8.
45th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR 2022 ; : 3154-3164, 2022.
Article in English | Scopus | ID: covidwho-1973879

ABSTRACT

Convincing people to get vaccinated against COVID-19 is a key societal challenge in the present times. As a first step towards this goal, many prior works have relied on social media analysis to understand the specific concerns that people have towards these vaccines, such as potential side-effects, ineffectiveness, political factors, and so on. Though there are datasets that broadly classify social media posts into Anti-vax and Pro-Vax labels, there is no dataset (to our knowledge) that labels social media posts according to the specific anti-vaccine concerns mentioned in the posts. In this paper, we have curated CAVES, the first large-scale dataset containing about 10k COVID-19 anti-vaccine tweets labelled into various specific anti-vaccine concerns in a multi-label setting. This is also the first multi-label classification dataset that provides explanations for each of the labels. Additionally, the dataset also provides class-wise summaries of all the tweets. We also perform preliminary experiments on the dataset and show that this is a very challenging dataset for multi-label explainable classification and tweet summarization, as is evident by the moderate scores achieved by some state-of-the-art models. © 2022 ACM.

9.
2nd International Conference on Digital Futures and Transformative Technologies, ICoDT2 2022 ; 2022.
Article in English | Scopus | ID: covidwho-1922690

ABSTRACT

In recent years, the rapid growth of data in healthcare has prompted a lot of interest in artificial intelligence (AI). Powerful AI algorithms are essential for extracting information from medical data and assisting clinicians in establishing quick and accurate diagnoses of a variety of ailments. In the current COVID-19 outbreak, critically ill patients were intubated and various medical tubes, including an endotracheal tube (ETT), were implanted to protect the airways. The Nasogastric tube (NGT) is used for feeding, whereas the Central Venous Catheter (CVC) is utilized for a variety of medical operations. The adoption of medical protocols by doctors to ensure proper tube installation is a major issue. Manual examination of CXR pictures takes time and frequently leads to misinterpretation. This research aims to create an Automated Medical Tube Detection System that can detect misplaced tubes from chest x-rays (CXR) using deep learning. As a result, using chest x-rays to detect poorly positioned tubes can save lives. On CXR the proposed CNN-based EfficientNet architecture efficiently detects and classifies incorrectly positioned tubes. After detailed experimentation, we were able to achieve 0.95 average area under the ROC curve (AUC). © 2022 IEEE.

10.
Cognitive Computation ; : 16, 2022.
Article in English | Web of Science | ID: covidwho-1885505

ABSTRACT

Nowadays, the global COVID-19 situation is still serious, and the new mutant virus Delta has already spread all over the world. The chest X-ray is one of the most common radiological examinations for screening catheters and diagnosis of many lung diseases, which plays an important role in assisting clinical diagnosis during the outbreak. This study considers the problem of multi-label catheters and thorax disease classification on chest X-ray images based on computer vision. Therefore, we propose a new variant of pyramid vision Transformer for multi-label chest X-ray image classification, named MXT, which can capture both short and long-range visual information through self-attention. Especially, downsampling spatial reduction attention can reduce the resource consumption of using Transformer. Meanwhile, multi-layer overlap patch (MLOP) embedding is used to tokenize images and dynamic position feed forward with zero paddings can encode position instead of adding a positional mask. Furthermore, class token Transformer block and multi-label attention (MLA) are utilized to offer more effective processing of multi-label classification. We evaluate our MXT on Chest X-ray14 dataset which has 14 disease pathologies and Catheter dataset containing 11 types of catheter placement. Each image is labeled one or more categories. Compared with some state-of-the-art baselines, our MXT can yield the highest mean AUC score of 83.0% on the Chest X-ray14 dataset and 94.6% on the Catheter dataset. According to the ablation study, we can obtain the following results: (1) The proposed MLOP embedding has a better performance than overlap patch (OP) embedding layer and non-overlap patch (N-OP) embedding layer that the mean AUC score is improved 0.6% and 0.4%, respectively. (2) Our demonstrate dynamic position feed forward can replace the traditional position mask which can learn the position information, and the mean AUC increased by 0.6%. (3) The mean AUC score by the designed MLA is more 0.2% and 0.6% than using the class token and calculating the mean scores of all tokens. The comprehensive experiments on two datasets demonstrate the effectiveness of the proposed method for multi-label chest X-ray image classification. Hence, our MXT can assist radiologists in diagnoses of lung diseases and check the placement of catheters, which can reduce the work pressure of medical staff.

11.
34th Australasian Joint Conference on Artificial Intelligence, AI 2021 ; 13151 LNAI:332-343, 2022.
Article in English | Scopus | ID: covidwho-1782718

ABSTRACT

There are many ways machine learning and big data analytics are used in the fight against the COVID-19 pandemic, including predictions, risk management, diagnostics, and prevention. This study focuses on predicting COVID-19 patient shielding—identifying and protecting patients who are clinically extremely vulnerable from coronavirus. This study focuses on techniques used for the multi-label classification of medical text. Using the information published by the United Kingdom NHS and the World Health Organisation, we present a novel approach to predicting COVID-19 patient shielding as a multi-label classification problem. We use publicly available, de-identified ICU medical text data for our experiments. The labels are derived from the published COVID-19 patient shielding data. We present an extensive comparison across 12 multi-label classifiers from the simple binary relevance to neural networks and the most recent transformers. To the best of our knowledge this is the first comprehensive study, where such a range of multi-label classifiers for medical text are considered. We highlight the benefits of various approaches, and argue that, for the task at hand, both predictive accuracy and processing time are essential. © 2022, Springer Nature Switzerland AG.

12.
Diagnostics (Basel) ; 12(3)2022 Mar 16.
Article in English | MEDLINE | ID: covidwho-1760432

ABSTRACT

Background and Motivation: Cardiovascular disease (CVD) causes the highest mortality globally. With escalating healthcare costs, early non-invasive CVD risk assessment is vital. Conventional methods have shown poor performance compared to more recent and fast-evolving Artificial Intelligence (AI) methods. The proposed study reviews the three most recent paradigms for CVD risk assessment, namely multiclass, multi-label, and ensemble-based methods in (i) office-based and (ii) stress-test laboratories. Methods: A total of 265 CVD-based studies were selected using the preferred reporting items for systematic reviews and meta-analyses (PRISMA) model. Due to its popularity and recent development, the study analyzed the above three paradigms using machine learning (ML) frameworks. We review comprehensively these three methods using attributes, such as architecture, applications, pro-and-cons, scientific validation, clinical evaluation, and AI risk-of-bias (RoB) in the CVD framework. These ML techniques were then extended under mobile and cloud-based infrastructure. Findings: Most popular biomarkers used were office-based, laboratory-based, image-based phenotypes, and medication usage. Surrogate carotid scanning for coronary artery risk prediction had shown promising results. Ground truth (GT) selection for AI-based training along with scientific and clinical validation is very important for CVD stratification to avoid RoB. It was observed that the most popular classification paradigm is multiclass followed by the ensemble, and multi-label. The use of deep learning techniques in CVD risk stratification is in a very early stage of development. Mobile and cloud-based AI technologies are more likely to be the future. Conclusions: AI-based methods for CVD risk assessment are most promising and successful. Choice of GT is most vital in AI-based models to prevent the RoB. The amalgamation of image-based strategies with conventional risk factors provides the highest stability when using the three CVD paradigms in non-cloud and cloud-based frameworks.

13.
Knowledge-Based Systems ; : 107853, 2021.
Article in English | ScienceDirect | ID: covidwho-1568907

ABSTRACT

Aortic dissection is a rapid and critical cardiovascular disease. The automatic segmentation and detection of related organs and lesions in CT volumes of aortic dissection provide great help for the rapid diagnosis and treatment of aortic dissection. However, the diagnosis of aortic dissection involves multi-organ and lesion segmentation, which is also a multi-label segmentation problem. It faces many challenges, such as small target scale, variable location of the true and false lumen, and complex judgment. To solve these problems, this paper proposes a deep model (MOLS-Net) to segment and detect aortic dissection from CT volumes quickly and automatically. First, the sequence feature pyramid attention module correlates CT image sequence features of different scales and guides the current image segmentation by exploring the correlation between slices. Secondly, combine the spatial attention module and the channel attention module in the decoder of the network to strengthen the model’s positioning accuracy of the target area and the feature utilization. Thirdly, this paper designs a multi-label classifier for the inter-class relationship of multi-label segmentation of aortic dissection and realizes multi-label segmentation on the end-to-end network. In this paper, we evaluate MOLS-Net on multiple datasets (self-made aortic dissection segmentation dataset and COVID-19 CT segmentation dataset), and the results show that the proposed method is superior to other state-of-the-art methods.

14.
J Biomed Inform ; 116: 103728, 2021 04.
Article in English | MEDLINE | ID: covidwho-1131454

ABSTRACT

BACKGROUND: Diagnostic or procedural coding of clinical notes aims to derive a coded summary of disease-related information about patients. Such coding is usually done manually in hospitals but could potentially be automated to improve the efficiency and accuracy of medical coding. Recent studies on deep learning for automated medical coding achieved promising performances. However, the explainability of these models is usually poor, preventing them to be used confidently in supporting clinical practice. Another limitation is that these models mostly assume independence among labels, ignoring the complex correlations among medical codes which can potentially be exploited to improve the performance. METHODS: To address the issues of model explainability and label correlations, we propose a Hierarchical Label-wise Attention Network (HLAN), which aimed to interpret the model by quantifying importance (as attention weights) of words and sentences related to each of the labels. Secondly, we propose to enhance the major deep learning models with a label embedding (LE) initialisation approach, which learns a dense, continuous vector representation and then injects the representation into the final layers and the label-wise attention layers in the models. We evaluated the methods using three settings on the MIMIC-III discharge summaries: full codes, top-50 codes, and the UK NHS (National Health Service) COVID-19 (Coronavirus disease 2019) shielding codes. Experiments were conducted to compare the HLAN model and label embedding initialisation to the state-of-the-art neural network based methods, including variants of Convolutional Neural Networks (CNNs) and Recurrent Neural Networks (RNNs). RESULTS: HLAN achieved the best Micro-level AUC and F1 on the top-50 code prediction, 91.9% and 64.1%, respectively; and comparable results on the NHS COVID-19 shielding code prediction to other models: around 97% Micro-level AUC. More importantly, in the analysis of model explanations, by highlighting the most salient words and sentences for each label, HLAN showed more meaningful and comprehensive model interpretation compared to the CNN-based models and its downgraded baselines, HAN and HA-GRU. Label embedding (LE) initialisation significantly boosted the previous state-of-the-art model, CNN with attention mechanisms, on the full code prediction to 52.5% Micro-level F1. The analysis of the layers initialised with label embeddings further explains the effect of this initialisation approach. The source code of the implementation and the results are openly available at https://github.com/acadTags/Explainable-Automated-Medical-Coding. CONCLUSION: We draw the conclusion from the evaluation results and analyses. First, with hierarchical label-wise attention mechanisms, HLAN can provide better or comparable results for automated coding to the state-of-the-art, CNN-based models. Second, HLAN can provide more comprehensive explanations for each label by highlighting key words and sentences in the discharge summaries, compared to the n-grams in the CNN-based models and the downgraded baselines, HAN and HA-GRU. Third, the performance of deep learning based multi-label classification for automated coding can be consistently boosted by initialising label embeddings that captures the correlations among labels. We further discuss the advantages and drawbacks of the overall method regarding its potential to be deployed to a hospital and suggest areas for future studies.


Subject(s)
COVID-19 , Clinical Coding/methods , Neural Networks, Computer , SARS-CoV-2 , COVID-19/epidemiology , Clinical Coding/statistics & numerical data , Deep Learning , Electronic Health Records/statistics & numerical data , Humans , Medical Informatics , Pandemics/statistics & numerical data , United Kingdom/epidemiology
SELECTION OF CITATIONS
SEARCH DETAIL